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Abstract 

Wc introduce a new notion of pathwise strategies for stochastic differential games. This 
allows us to give a correct meaning to some statement asserted in [2]. 

1 Introduction. 

In this short note we develop a new notion of strategies for stochastic differential games. We 
present our concept in the framework of two-player, zero-sum, differential games. The players 
are labelled Player I and Player II, Player I being minimizing while Player II is maximizing. We 
assume that the players have perfect monitoring, i.e., they observe each other's action perfectly. 
The state of the game satisfies a stochastic differential equation, that we assume driven by a 
Brownian motion, and which is controlled by both players. 

Nonanticipative strategies for deterministic differential games have been introduced in a 
series of papers by Varaiya [7J, Roxin [6], Elliott and Kalton in this framework a strategy 
(for Player I) is a nonanticipative map from Player II's set of controls to Player I's. Adapting 
this idea to stochastic differential games lead Fleming and Souganidis in their pioneering work 
[I] to define a notion of strategy (again for Player I) as nonanticipative map from the set 
of adapted controls of Player II to the set of adapted controls of Player I. This approach has 
subsequently been used by most authors working on stochastic differential games, sometimes 
with some variants: see, e.g., Buckdahn-Li pQ. 

If the notion of nonanticipative strategies makes perfectly sense for deterministic differen- 
tial games — because the players indeed observe each other's action — one can object that, for 
stochastic ones, the players do not actually observe their opponent's adapted control, but just a 
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realization of this control in the actual state of the world: more precisely, assume that Player II 
plays the control v = v(t,u). Then, in the state u and at time t, Player I has not observed the 
full map (s,u/) — > w(s,o;| ot j), but only the map s — > v(t,u}[ 0)t ]). For this reason, the authors of 
the present paper introduced in [2] a notion of pathwise nonanticipative strategies, formalizing 
the fact that the players only observe their opponent's action in the actual state, as well as the 
path of the resulting solution of the stochastic differential equation. 

Unfortunately handling such pathwise strategies is quite subtle and, in [2], we overlooked 
some difficulties (we explain this in details in section [3]). In the present paper we show how to 
overcome this problem. We still keep the flavor of pathwise strategies, but require the stronger 
condition that the players observe the control actually played by their opponent as well as the 
Brownian path (instead of the trajectory of SDE). The key point is that the players can never- 
theless deduce the resulting solution of the SDE: to show this we use the pathwise construction 
of stochastic integrals introduced by Nutz [5] . 

This note is divided into two parts: first we introduce the new notion of strategies and show 
the existence of a value and its characterization for a classical two-player zero-sum game with 
a final cost (to better explain our ideas, we have chosen to present our approach in this simple 
framework). The second part of the note is devoted to the erratum of the paper [2]. 

Acknowledgment : We are very much indebted with Rainer Buckdahn, who pointed out to 
us the flaw in the paper [2]. 

This work has been partially supported by the Commission of the European Communities 
under the 7-th Framework Programme Marie Curie Initial Training Networks Project SADCO, 
FP7-PEOPLE-2010-ITN, No 264735, and by the French National Research Agency ANR-10- 
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2 The classical stochastic differential game revisited. 

Let T > be a deterministic time horizon. For all t 6 [0,T], let fit be the set of continuous 
maps from [t, T] to TR d endowed with the a-algebra generated by the coordinate process and Pt, 
the Wiener measure on it. We denote by W the canonical process: W s {lo) = u(s). We introduce 
also the filtration F t = (J 7 ^ = a{W r — Wt,r € [t, s]}, completed by all null sets of Pt. 

For any t € [0,T] we denote by C°([t, T], TR N ) the set of continuous maps from [t, T] into 
1R N endowed with the sup norm and by Bt the associated Borel a— algebra. 

The dynamic of the game is given by 

dX s = f(X s ,u s ,v s )ds + a(X s ,u s ,v s )dW s ,s <E [t,T], 
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with u and v two F^-adapted processes with values in some compact metric spaces U and V. 
The process u (resp. v) represents the action of Player I (resp. Player II). We denote by X t,x,u ' v 
the solution of (|2.ip . 

Throughout the paper, the maps / : TR N X U X V ->■ M N and a : R N X U X V -»• IR Arxd 
are supposed to be bounded, continuous, Lipschitz continuous in (t, uniformly with respect 
to u, f . The sets U and V are compact subsets of finite dimensional spaces. We denote by Ut 
(resp. Vf) the set of measurable maps from [t,T] to U (resp. V), while U{t) (resp. V(t)) denotes 
the set of F 4 -adapted processes with values in U (resp. V). In what follows, the sets Ut and Vt 
are endowed with the L 1 -distance and the Borel <7-field generated by it. 

Definition 2.1. A strategy for Player I at time t is a nonanticipative, Borel-measurable map 
a : Qt x Vt — > Ut with delay: there exists 5 > such that, for any t' E [i,? 1 ], all (^1,^2) E V 2 
and for Pt-a.s. any (u)\, LO2) E ^f, «/ (^1, ^1) = (^2, ^2) on [i,£'] ; £/ien = ct(u2, V2) 

a.s. on [t, t' + S\. Strategies for Player II are defined in a symmetrical way. The set of strategies 
for Player I (resp. Player II) is denoted by A(t) (resp. B(t)). 

Let us point out that, for all a E A(t) and v E V(t), a(v) is a process and belongs to U(t). 
In the same way, for all f3 E B(t) and u E U{t), j3(u) belongs to V(t). We denote by Ud{t) the 
subset of U{t) of controls u E U(t) for which there exists some 5 > such that, (u s ) se [ t ,T] is 
adapted to := (J~t,( s -8)\/t)- The set Vd(t) is defined in a similar way. We remark that the 
elements of Ud{t) an d Vd{t) are predictable for the original titration Ft- 
Now we can state our fix point lemma: 

Lemma 2.2. For all t E [t, T], for all (a, j3) E A(t) x B{t), there exists a unique pair of controls 
(u,v) GUd{t) x Vd(t) which satisfies, P-a.s. 

u = a(v), v = p(u). (2.2) 

Proof. Let 5 > be a common delay for a and f3. We can choose 5 such that T = t + N5, for 
some N E IN*. We define on Q t , Mk (resp. V&) the set of F* -adapted processes on the time 
interval [t,t + kd) with values in U (resp. V). 

By definition, on [t,t + 5), the control a(u,v) does not depend on (u),v): we can set, for all 
(u), v ) E x Vj, a(cj, u) = no, where uo E Z^o (in fact ito is deterministic). And in the same way, 
there exists vq E Vo such that, for all (a?, u) E 0< x Ut, (3(uj,u) = vq. 

Assume now that, for some k E {0, . . . , N— 1}, there exists a pair (uk,Vk) GllkxVk such that, 
on [i, £ + k5), a(vk) = and /3(v,k) = Vk P-a.s. . We set, for all oj E Oj, Ufc +1 (w) = u^iuj) on 
[t, i + /c<5) and, since a is non anticipative with delay 6, it makes sense to set u^+i = a(oj, Vk{oj)) 
on [t + kd, t + (k + 1)<5). By assumption, and are adapted to F^ and a is nonanticipative. 
It follows that a u^+i is also adapted to F^. The process w^+i can be defined in a similar way. 

At the end it is sufficient to set (u,v) = (un,vn) to get the desired result. ■ 
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The main issue with our notion of strategies is that it is not clear that the observation of 
the brownian path and of the realized control of the opponent up to some time t' suffices to 
compute the position of the system at time t'. The following Lemma — which is the main point 
in our approach — says that this is actually the case. 

Lemma 2.3. Fix (t,x) G [0, T] x IR^. For all a.Q G A(t), there exists a measurable function 
F := F ao>t>x : (V t x fl t , B(V t ) ® P t ) -> (C°([t,T],M N ),B t ) such that, for alive V«j(t), 

F(v(-),-) =X t ' x ' ao ^'" P t -a.8. . 

Furthermore the map F is nonanticipative, in the sense that there exists Q,' t C Qt with Pt(£l' t ) = 1 
such that, for all uj,uj' G £l' t and v,v' G Vt, if (v,u) = (v',u)') a.s. on [t,t'], then, F(v,u) = 
F(v',oj') on [t,f]. 

Proof. Let v G Vd(t) and Pf the law on Vt x f2 t of (v, W) under P t . We endow the set Vt x f2 t 
with the following filtration : let Vt t> be the set of measurable maps from [t, t'] to V and set 
B t ,t> ={{veV t , v m £B},Be B(V t ,t>)}. Set P^ = f] (B t ^ ® J*,, V M p "), with W p " the 

set of null sets for the probability P v . Then (P£ s ,s G [t,t']) is a filtration satisfying the usual 
assumptions and in which {W s (v,u) := uj(s),s G is a Brownian motion. 

On the filtered probability space (Vt x Q t ,B(Vt) ®Pf>Pt\ {^tsi s £ we consider now the 

following SDE: 

J dX s = 6(X s ,a (v) s ,v s )ds + cr(X s ,ao(v) s ,v s )(iW s , s G [*,*'] > 
1 X t = x, 

with 

f W(u,w)s = w(s), 
[ v(u,o;) a = v(s). 

Then (|2.3p has a strong solution which law, under Ff on Vt x i^t coincides with the law 

of X*.*.oo(«).« under P t on 

We now apply the main Theorem of [5] to the above filtered space and the processes 
q _( s \ _/K4^ (v) ' V ,ao(v) S) v s ) \ 

We obtain that there exists a map F : V t X fl t ->■ C°([t, T], IR^) which is adapted with respect to 
the filtration (Pts)> such that, for all t? G Vd(i) and for all bounded test function </? : JR N x^-} 
IR, 

■),-)] = e![<p(f)} = eMx*/™)) = EMX*/'" ^)]. 

The nonanticipativity of F follows from the fact that F is adapted with respect to (Pt s )- ^ 
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We now define the value functions of the game. Given a bounded and Lipschitz continuous 
terminal cost g : IR^ — > IR, an initial position (t, x) € [0, T] x JR N and a pair of adapted controls 
(u,v) € U{t) x V(i), we define the cost function 

J(t,x,u,v) = E t g(Xj, x ' u,v ) 

It is well known that, for all pair of controls (u, v) G U{t) X V(t), (s,x) t- > J(s, x,u,v) is 
Lipschitz in x and Holder in s, uniformly in (u, v). It follows that, for all (a,v) S A(t) X V(t), 
(s,x) h4 J(s,x,a(v),v) is also Lipschitz continuous in cc and Holder continuous in s, uniformly 
in a and u. Furthermore, for all (t, t', x) € [0, T] 2 x IR^ with t < t', and e > there exists R > 
such that, if we denote by -Br(x) the ball in IR^ with radius R and center x, we have, for all 
(a,v) e A(t) x V(t), 

P t[ ^.«M.« eBfl(a . ))]>1 _ € . 

We introduce now the value functions of the game: for all (i, x) G [0,T] x IR^, we set 

F + (i,x) = inf sup J(t,x,a,(3), (2.4) 
ae-4(t) /3 G B(t) 

and 

V~(t,x)= sup inf J(t,x,a,(3). (2.5) 
/3eS(t) ae^l(t) 

It is clear that V~(t, x) < F + (i,x). Moreover we have the equivalent formulations 

V + (t,x)= inf sup J(t,x, a{v), v) and F _ (t,z)= sup inf J(t,x,u, (3(u)). 
aeA(t) v ev d {t) 0eB(t) ueu d (t) 

Proposition 2.4. The value functions V + and V~ are Lipschitz continuous in x, and Holder 
continuous in t. 

Proof. The proof is a straightforward consequence of the regularity of J. ■ 

Now we are able to establish a subdynamic programming principle. 

Proposition 2.5. For all x 6 IR^ and < to < t\ < T , the following subdynamic programming 
principle holds: 

V+(t ,x)< inf sup E^V+^xt ^' 1 ')]. (2.6) 

a£A(t ) veV d (t ) 

In particular, V + is a viscosity subsolution of the following Hamilton-, J acobi- Isaacs equation 
' V t + H + {D 2 V,DV,x,t) = 0, (t, x) G [0, T] x JR N , 

(2.7) 

V(T,x)=g(x), xGlR^, 
where H + [A, £, x, t) = inf sup ( -tr(aa* (i, x, u, v) A + (b(t , x, u, v), £} ) . 



Proof. Following [3] we set fit % x = {uj : [tojii] — > lR d continuous}. For uj G f2t , we define the 
pair (wi,w 2 ) € x ft fl by 



wi = w| [t0>tl ), w 2 = w|[ tl ,r| - w(ti). 
The map w i— > vt(oj) := allows to identify Qt with fito,^ x $7^ and we have p 



to 



P 



to,ti 



Ptu where P^,^ is the Wiener measure on f^,^. 



For u € V(to), we denote by t> 2 the restriction of u on [ii,T]. We further set {^(cji, oj 2 ) := 
u 2 (w) and remark that, if v G Vd(to), then (S 2 (u;i, •), wi G f^ ) is a family of processes which 
belongs to Vd(ti). 

Let us now denote by V(to,t\,xo) the right-hand side of (|2.6|) . We fix e > and consider 
ao G A(to) e-optimal for V(to, t\, xq): for all v G Vd(to), 



s to [F+(t 1 ,<°' a:( " aQ ^)]<y(to,ti,xo) + e . 



(2. 



Let 5 > be the delay of ao- We can suppose that 6 < e 2 A (ti — to). Let R > be such that, 
for all v G V d (*o), 



P 



to 



> 1-e. 



For K G IN large enough, let {Oo, . . . Ok} be a Borel partition of IR such that 0±, . . . , Ok 
have a radius smaller than e and Br(xq) C Uj^Ofc. Pick, for each A; G {1, . . . ,K}, x k G Ok 
and a k G .A(ti) e-optimal for V + (t\,x k ). We fix a G A(t\) some arbitrary strategy. By lemma 
73l there exists a measurable, nonanticipative map F :V to x Q to -> C ^) such that, for all 



u e Vd(t ), Po-a.s., X^ ,ao(v),v (u) = F(v(u),u)t>. We define a new strategy a e G A{t ) by 



a (u,w) S) if s G [io,*i), 

a fc (v 2 ,w 2 ) s , if s G [ti,T] andP(v,w) G O k , ke{0,...,K}. 



Set, for all k G {0, . . . , K}, A k = {x*'*?" ^" G O k } C J" to , tl . Recall that the sets {F(v) G O k } 
and ^4^ differs only by a Pt -null set, and that Pj (^4o) < e - 

Since y + is bounded, Lipschitz continuous in x and Holder in t, we get, for all v G V d (ii) 

if 



E, 



to 



to,xo,ao(v),v 



E. 



to 



^A k V + ( tl ,Xt° 



to,xo,ao(v),v 



fc=0 



> & 



to 



K 

£ 

,fc=i 



|^ + ||ooP (^o) (2-9) 



> £ 



to 



K 



,k=l 



Ce 



where C denotes a constant which changes from line to line. 
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Now let us come from the left hand side of f|2 . 6|) : For any v € Vd(to), we can write: 

K 



J(t ,x ,a e (v),v) = Eto 



, v tr\,xr\,otr)(v) ,v i, , v 



k=0 



(2.10) 



But, for all k £ {1, ... , K}, for Pf -allmost u £ O^ , we have 



E, 



to 



g( x T )k*i 



g(X T 



= J(t 1 ,X t t «> xo ' ao ^ v ,a k (v2),v 2 )(Lo). 

(2.11) 

Since J is Lipschitz continuous in x and Holder in t and a k is e-optimal for V + {t\,Xk), it holds 
that 



to 



J(ti,-X-J' So ' ao(,,) ' w ,a fc (u 2 ),« 2 )l < £ to [u fc (J(t 1 ,x,,a fc (T; 2 ), W2 ) + Ce)] 



(2.12) 



< £^[u fc (F+(t 1} a; fc ) + C€)]. 

Putting together (|2"^ ]) - (|2"7[2"]) . we get 

J (to, SO) oi e (v), v) < V(t , h,x ) + Ce. 

Taking the sup over v G Vd(to) then gives the result. 

The proof of the supersolution property from the subdynamic programming is standard (see 

S). ■ 

In a symmetrical way, we obtain a superdynamic programming principle for V~ and the fact 
that V~ is a subsolution: 

Proposition 2.6. For all x E IR and < to < t\ < T , the following superdynamic program- 
ming principle holds: 



V-(t ,x)> inf sup Etjy-(t u X%™ m 
peBfa) ueu d (to) 



)]■ 



Therefore V is a supersolution in viscosity sense of the following Hamilton-, J acobi- Isaacs equa- 
tion 

V t + H~(D 2 V,DV,x,t) = 0, (t,x) e[0,T]xM N , 

V(T,x)=g(x), xeJR N , 
where H~(A, £, x, t) = sup veV inf u€ u ^tr(aa*(t, x, u, v)A + (b(t, x, tt, • 

We can now follow [1] to obtain: 
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Theorem 2.7. Under Isaacs' condition: 



VA G S(N), £ G E , x G Hi , t G [0,T], F + (A(,x,t) = ir~(A,f,s,i) := H(A,£,x,t), 



3 Erratum to "Stochastic Differential Games with Asymmetric 
Information" [2j. 

The definition of strategy introduced in Definition [2J] is mainly motivated by a gap in the paper 
[2]. This paper deals with two-player, zero-sum differential games in which the players have a 
private information on the game. The flaw in the paper is not with this information issue, but 
with some technicalities arising with the notion of strategy developed there. 

In the framework of [2] a strategy for player I starting at time t is a Borel-measurable map a : 
[t, T] x C([t, T],M n ) -> U for which there exists 6 > such that, Vs G [t, T],f, f G C([t, T], IR n ), 
if / = /' on [t, s], then a(-, f) = a(-, /') on [t, (s + 6) A T]. With this definition, Player I only 
needs to observe the realization / of the solution of the stochastic differential equation. We can 
use this notion of strategy to define as in (|2.4p and (|2.5p the value functions V + and V~ via a 
fixed point argument very close to Lemma 12.21 The issue arises when one tries to prove that 
these value functions are Lipschitz continuous in space. Indeed, given a strategy a as above, 
an adapted control v G V(t) and two initial conditions x and x', there seems to be no way to 
built a new strategy a' such that the solutions X t,x,a ' v and X t,y ' a '' v are sufficiently close (in 
particular, the idea consisting in choosing a'(s, f) = a(s, f — x' + x) does not seem to work). 
As a consequence, there is a serious gap in the proof of Lemma 2.2 of [2]. 

In order to correct this, we have to change the notion of strategies and replace it with the 
one developed in the present note. This implies several changes, that we list below. 

1. As in section [21 we assume that we work on the Wiener space = C([t, T] , IR d ) endowed 
with the Wiener measure Pt and consider, for all initial time t G [0, T], the canonical 
process (B s (uj) = w(s),s G [t, T\). The filtration (Tt >s ,t < s) is the one generated by the 
canonical process. 

2. The definition of strategies (Definition 2.2 in [2]) must be replaced by the one in Definition 
12.11 This new notion of strategy must also be used in the definition of random strategies 
defined in [2], p. 5. 



the game has a value V + = V which is the unique solution in viscosity sense of 



V t + H(D 2 V, DV, x, t) = 0, (t, x) G [0, T] x IR^, 



(2.13) 
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3. The fixed point (Lemma 2.1 in [2]) has to be replaced by Lemma l2.2i 

4. The Lipschitz continuity of the value functions (Lemma 2.2 in [2]) becomes straightforward 
because one can now use the same strategy for different initial positions and get an estimate 
as in Proposition 12.41 

5. In the proof of Proposition 3.1 in [2], the construction of the strategy has to be modified 
as follows: by Lemma 12.31 there exists a measurable map F : U t x n -»• JR N , such that, 
for all u E Ut, it holds that 



Now, fori EL,l = (l ,...,l M ), we define {pff € B(t ) by: V(«,w) € C/ to xO to ,Vi £ [i ,T], 



1 /^ m '' m H [tl)T] ,u; 2 ) if t € [ti,T] and F{u,uo) € £? m . 

We set ^ := (Q3|)'; s l j,l £ L) £ B(t ), and finally /3 £ = (/3f , . . . , Then we can check 



6. In the proof of the Corollary 3.1 in [2j, the strategy has to be changed in the following 
way: we set f3o(v,u)t = vq for all (t,v,uS) € [to,T] x Vt x J) to . 
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